Picture for Pengfei Yu

Pengfei Yu

Dual-Pathway Geometry-Aware MLLM for Spatial Intelligence

Add code
May 25, 2026
Viaarxiv icon

Self-Consistent Latent Reasoning: Long Latent Sequence Reasoning for Vision-Language Model

Add code
May 13, 2026
Viaarxiv icon

When to Think, When to Speak: Learning Disclosure Policies for LLM Reasoning

Add code
May 05, 2026
Viaarxiv icon

StreamingClaw Technical Report

Add code
Mar 23, 2026
Viaarxiv icon

EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning

Add code
Mar 12, 2026
Viaarxiv icon

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration

Add code
Mar 09, 2026
Viaarxiv icon

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

Add code
Dec 29, 2025
Viaarxiv icon

Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

Ordered-subsets Multi-diffusion Model for Sparse-view CT Reconstruction

Add code
May 15, 2025
Viaarxiv icon

LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation

Add code
Feb 25, 2025
Figure 1 for LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Figure 2 for LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Figure 3 for LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Figure 4 for LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Viaarxiv icon